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Abstract. In this paper we are proving that Sawyer type condition for boundedness work 
for the two weight estimates of individual Haar multipliers, as well as for the Haar shift and 
other "well localized" operators. 



0. Introduction 

The main question of this paper concerns two weight estimates for singular integral oper- 
ators, i.e. the questions when an integral operator T is bounded operator from a weighted 
space L 2 {w) to L 2 {y). 

One of the most interesting cases is the case when T is the Hilbert transform, Tf(s) = 
- J jzjdt, although the case of more general Calderon-Zygmund operators seems to be of 
great interest as well. For such operators no "real variable" necessary and sufficient condition 
is knownHI 

In this paper we deal with dyadic analogues of the Hilbert transform, the so-called Haar 
multipliers and their generalizations. It turns out that for such operators it is possible to 
find necessary and sufficient condition for two weight estimates. 

Let us introduce one of the main examples. The standard dyadic grid V = V is the 
collection of all dyadic intervals [2 k ■ j,2 h ■ (j + 1)), k,j G Z. A general dyadic grid is an 
object obtained from X> by a dilation and a shift. 

For an interval I C Mwe define the (L 2 -normalized) Haar function h. := \I\^ 1 ^ 2 (x I ~X I )> 
here |/| stands for the length of the interval J; I + and J_ denote its right and left halves 
respectively, and \ E denotes the characteristic function (indicator) of the set E. 

Given a sequence a = {otj} IeT) define the Haar multiplier (a.k.a. the martingale transform) 
T a by 



lev 



We are interested under what conditions on the weights v, w (v, w > 0, v, w G Lj oc ) the 
operator T = T a extends to a bounded operator from L 2 {w) to L 2 (v), i.e., when the following 
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1 For the Hilbert transform a necessary and sufficient condition of Helson-Szego type was obtained by 
M. Cotlar and C. Sadosky [CSlj . Their condition was stated in the language of complex analysis: the 
Hilbert transform is a bounded operator from L 2 (w) — ► L 2 (v) if and only if one can find a function h in 

the analytic Hardy class H 1 such that for some C > the matrix ( J~" W V -r 1 J is positive 



Cw + v — h Cw — v 

scmidcfinitc a.c. No generalization of this condition to other Calderon-Zygmund operators is known 
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two weight estimate 

[ \T a f\ 2 vdx < C I \f\ 2 wdx 
Jr Jr 

holds. 

If one assumes that w~ x G L\ oc (this assumption is satisfied in practically all interesting 
cases), then the above estimate means that the operator M v i/2T a M w -i/2, where stands 
for the operator of multiplication by 0, is bounded in the non-weighted L 2 space. 

In |NTV1] this question was studied for the family of Haar multipliers T aa , where a = 
{oCj}j v is fixed and o = {Cj} Jg:D is an arbitrary sequence of signs, a I = ±1. It was 
shown that the operators M v i/2T aa M w -i/2 are uniformly (over all possible choices of signs a) 
bounded if and only if the operators are uniformly bounded on the test functions w~ l ^ 2 x v 
I G V, and their adjoints M w -i/2T aa M v i/2 are uniformly bounded on the test functions 

v l ' 2 Xr lev. 

Conditions where the boundedness of an operator follows from its boundedness on special 
test functions as above are called "Sawyer type conditions" , after E. Sawyer, who proved in 
|S1] that such condition is necessary and sufficient for two weight estimates of the maximal 
function (only one condition is needed there). After that there were quite a few results that 
Sawyer type conditions are sufficient for the boundedness of (clearly they are necessary) of 
different classes of integral operators with positive kernels. 

The above mentioned result about Haar multipliers was a first (and up until now a unique) 
result giving necessary and sufficient condition for the boundedness of operators with alter- 
nating kernels. The fact that helped us a lot was that we were dealing with a family of 
operators, and that allowed us to reduce the problem to estimates of an operator with non- 
negative kernel. 

At that moment it seemed impossible to get a sharp result about two weight estimates for 
an individual Haar multiplier, at least our method did not allow us to do that. 

So, the main result of the paper looks quite surprising: the Sawyer type condition works 
for two weight estimates of an individual Haar multiplier (as well as for a wider class of 
so-called "band operators")! 

Before giving a formal definition, let us present one more important example of an "band 
operator" , the so called Haar Shift S 

s f ==£(/> mv-^J- 

lev 

This operator is interesting, in particular, because if we average it over all dyadic grids 
(translated and rescaled) we get the Hilbert transform (up to a multiplicative constant), see 

n 

1. TWO WEIGHT ESTIMATES OF BAND OPERATORS: FORMAL DEFINITIONS AND MAIN 

RESULTS 

1.1. Main definitions. Let D be the standard dyadic lattice in WL N . Let us recall the 
definition: for each k G Z we consider the cube [0, 2 h ) N and all its translations by elements 
of M. N with coordinates of form j ■ 2 k , j G Z, then take union over all k. Each dyadic cube 
has 2 N "sons" , and each cube has a unique "parent" . 

In this paper by a cube we will always mean a dyadic cube. 
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For a cube Q let £(Q) denote its size, i.e. the length of its side. And for a cube Q let Q 
be the kth grandparent of the cube Q, i.e. the cube R of size 2 k £(Q) containing Q. 

A (non- weighted) Haar function on a cube Q is a function hg supported on Q, constant on 

the "children" of Q and orthogonal to constants, Jg h^dx = 0. The set of all Haar function 

on a cube Q C R is a vector space of dimension 2^ — 1. 

With a dyadic lattice T> in M. N one can naturally associate a (non-oriented) 2 7V -ary tree, 
where each dyadic cube is connected to its 2 N "sons". By the "tree distance" (or "graph 
distance") d tm e(Q,R) between cubes Q,R G V we understand the distance on the graph, 
where we assign length 1 to each wedge. 

And now the formal definition: 

Definition 1.1. A band operator on R is a bounded operator in L 2 (IR d ) whose matrix in the 
Haar basis has the "band structure", meaning there exists reZ + such that (Th Q , h R ) = 
for all Haar functions Iiq, Iir such that d tree (Q, R) > r. 

The Haar multipliers (martingale transforms) discussed in the introductions are band 
operators with r = 0, and the Haar shift is a band operator with r = 1. 

1.2. Two weigh estimates. As we already mentioned above, the operator T can be ex- 
tended to a bounded operator acting from L 2 (w) to L 2 (v) if and only if the operator 
can be extended to a bounded operator in the (non- weighted) L 2 . Denot- 
ing u := w' 1 we can write the problem in more symmetric form, namely, when the operator 
can be extended to a bounded operator in the non- weighted L 2 ? Here we assume 
that u and v are L\ oc weights. This formulation is (at least formally) a bit more general then 
the formulation with two weight estimates, because here we do not assume that u^ 1 G L\ oc 
(and we did assume above that w^ 1 G Ll oc ). 

Now we are ready to state one of the main results of the paper: 

Theorem 1.2. Let T be a band operator, and let u, v > 0, u, v G L] oc . Then the operator 
M^ 2 TMu 2 extends to a bounded operator in L 2 if and only if for all dyadic cubes Q the 
following two inequalities hold: 

(1.1) / \T{ Xo u)\ 2 vdx < C udx, 

Jr n q Jq 

(1.2) / \T*( Xo v)\ 2 udx<C [ vdx. 

1 /2 1/2 

Moreover, the norm of the operator M v TM U can be estimated by a constant depending 
only on the dimension N, the number r in the definition of the band matrix, and the constants 

c m flu}, (H2D 

The above theorem says that to verify the boundedness of the operator Ml^ 2 TMu 2 it is 
sufficient only to check its boundedness on the test functions u 1 ^Xq an d the boundedness of 

its formal adjoint Mu 2 T*Ml^ 2 on the test functions v x I 2 Xq fo r all dyadic cubes Q. 

Remark 1.3. A careful reader can ask a question here on how one can interpret the expressions 
T(x u ) an d T*(x v). The problem is that the operator T (and so its adjoint T*) are defined 
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only on L 2 , so if one only assumes that u, v G L\ oc , the above expressions formally are not 
defined. 

However, it is easy to give the meaning to the conditions (II. ip . (II. 2p . First of all, T(Xq u ) 
and T*(Xq v ) are we h defined if u, v G L 2 oc . Second, it is trivial, that for any weight v ' < v 
we have \\M^/ 2 TMu || < || M^TM^!!, and by taking adjoint one can conclude that for any 
weight u'<iiwe have \\Ml ,2 TM l J\\ < \\mI /2 TMI' ||. 

Therefore, for any sequence of weights u n , u n G L 2 (Q), u n f u, the expression Tu n Xg is 

well defined. If, in addition, one assumes that the operator M^ 2 TMu 2 is bounded, then 
Wv^-^TunXg ||l 2 < C < oo, and moreover v 1 / 2 Tu u Xq converges to some functions in L 2 . It is 
easy to see that this limit function does not depend on the choice of the sequence u n , so we 
call this function v l ^ 2 Tx n u. The right side of (11.11) then can be interpreted as the L 2 norm 
of this function. The dual condition (11.21) can be interpreted similarly. 

Remark 1.4. For the sufficiency part one can think even of a simpler condition to interpret 
(11.11) . (II. 2p . For example, one can pick an increasing sequence of weights u n G L 2 oc /* u (for 
example u n (x) = min{u(x), n}) and interpret the condition (11.11) as the uniform estimate 
(independent of n and Q) 

/ \T(x u n )\ 2 vdx < C udx 

The dual condition (II .2p is interpreted similarly by picking an increasing sequence v n /* v. 

Indeed, these conditions would imply the conditions (II. ip . ( 11.21) with u and v replaced by u n 
and v m respectively. That, by Theorem (II. 2p implies the uniform estimate ^Ml^TMu^W < 
C < oo which implies the estimate ||My 2 TM,y 2 || < C. 

The conditions of Theorem 11.21 can be relaxed a bit by integrating only over the cubes Q: 

Theorem 1.5. Let T be a band operator and let u, v > 0, u, v G L] oc . Then the operator 
M^ 2 TMu 2 extends to a bounded operator in L 2 if and only if for all dyadic cubes Q the 
following two conditions hold: 

(1) For all dyadic cubes Q, R satisfying 2~ r £(Q) < £(R) < 2 r £(Q) 

(2) For all dyadic cubes Q 

/ \T(Y„u)\ 2 vdx < C udx, / \T*(Y„v)\ 2 udx < C vdx. 

Jrn 1 VA « - Jq Jrn 1 VA « " J Q 

1 /2 1/2 

Moreover, the norm of the operator M v TM U can be estimated by a constant depending 
only on the dimension N, the number r in the definition of the band matrix, and the constants 
C in the above conditions (1) and (2). 

In this paper (•, •) stands for the standard inner product in L 2 (M. N ), (f,g) = f fgdx. 
Note that 

(Tx Q u,X R v) = {Mll 2 TMH 2 x Q u xt2 ,x R v 112 ), 
so, as we discussed above in Remark [l~3l this expression is well defined. 
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Remark 1.6. Note, that in fact one does not even need the operator T to be bounded in 
L 2 . A bit more elaborate reasoning than in Remark 11.31 would allow us to interpret the 
conditions of Theorem 11.21 in the case when we only require the bilinear form (Tf, g) of the 
operator T be defined on bounded compactly supported functions / and g. We leave the 
details here as an exercise for the reader. 

2. Well-localized operators for general measures 

To prove theorems 11.21 and 1 1 . 51 we prove even a bit more general results about the so-called 
well-localized operators T = : L 2 (fx) — > L 2 (v) for general measures /x, v that may have 
singular parts. 

2.1. Heuristics and formal definition. The idea behind the notion of well-localized op- 
erators is rather simple. If one has an integral operator T, 

T f(x) = J K(x,y)f(y)dm(y), 

where the integration is with respect to the Lebesgue measure m in M. N , one can consider 
(at least formally) the operator where the integration with respect to the measure fx, 

T nf( x )=f K (x,y)f(y)dfx(y). 

Such reduction is quite common in weighted estimates. For example, two weight estimates 
for integral operators T can be reduced to estimates of operators T M : L 2 (fx) — > L 2 {y) with 
appropriately chosen measures fx and v. One of the advantages of such representation is that 
the integration in the operator and in the computation of the norm is with respect to the 
same measure fx. 

Since a rank one operator ( • , f)g is an integral operator with kernel g(x)f(y), one can 
formally represent any operator T is L 2 as an integral operator by considering its matrix in 
an orthonormal basis (for example in the Haar basis) and writing it as a sum of rank one 
operators. And then one can replace the integration with respect to the Lebesgue measure 
by the integration with respect to the measure fx. However this is only a formal reasoning, 
because, first of all, the resulting "kernel" of the "integral" operator does not need to be a 
function. And even if the kernel K is a function, it is completely not clear how to interpret 
the operator T M . 

So, instead of writing a band operator as an integral operator (which is not always possible, 
for example the identity / is a band operator) and then trying to interpret the operator T M , 
we pick an axiomatic approach. 

Looking at the above formal reasoning, one can figure out what structure the matrix of 
the operator T M should have with respect to the weighted Haar bases in L 2 (/x) and L 2 [y\ if 
one start with a band operator T. It is not difficult to see, that while the matrix of T M in 
the weighted Haar bases is not generally a band matrix, it has some special properties, some 
traces of the band structure of T. 

So, we took this special structure of the matrix in the weighted Haar bases as the definition 
of the well-localized operators. Later in this section we will show how for a band operator 
T one can rigorously reduce the estimates of the operator mV 2 TMu 2 to the estimates of of 
the appropriate well-localized operator 
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And now main definitions. 

First, let us define the weighted Haar system. For each cube Q a Haar function with 
respect to a measure // (/i-Haar function) on Q is a function fig supported on Q, constant 
on all 2 N "children" of Q, and such that hg dfi = 0. Note that for a given Q the set Hq 

of all /i-Haar functions on Q is a subspace of dimension at most 2 N — 1 (can be less since 
degenerate situations are possible). The set Hq of non- weighted Haar functions on Q has 
exactly dimension 2^ — 1. 

Let fj, and v are (regular Borel) measures in M. N . Let us be given an operator T = T M 
acting from L 2 (fi) to L 2 (u). By given we mean that we know its bilinear form (TXqiX r )v 
on characteristic functions of cubes; here ( • , • )„ is the inner product in L 2 (i>), 




Note that the above bilinear forms also define a formal adjoint T* = T* of T = T^. 

Definition 2.1. Let T = T M be he operator defined above, acting (formally) from L 2 (fi) to 
L 2 (u). We say that T M is lower triangularly localized if there exists a constant r > such 
that for all cubes R and Q, £(R) < i(Q), and for all z/-Haar functions h R on R 

ifR<£ Q {r) or if £(R) < 2- r £(Q) and R(JLQ. Here, recall, is the "grandparent" of order 
r of the cube Q. 

And we say that the operator T M is well localized if both T M and its formal adjoint T* are 
lower triangularly localized. 

Note, that the Haar multipliers and Haar shift, discussed in the Introduction are well 
localized in the sense of the above definition, see Section 12.21 below. Note, that while the 
matrix of a Haar multiplier T a in the non-weighted case has only one diagonal, the matrix 
of the weighted version may have infinitely many. 

2.2. From weighted estimates for band operators to estimates of well localized 
operators. First we want to show that two weight estimates for the band operators can be 
reduced from the estimates for the well-localized operators. 

Let T be a band operator in the non-weighted L 2 (IR iV ) and let u, v > 0, u, v G L\ oc be two 

1/2 1/2 

weights. Suppose we want to know whether the operator M v TM U is bounded. 

Let us denote dv = vdm, dfi = udm, where m is the Lebesgue measure in IR . Note that 
M u i/2 : L 2 (/i) -> L 2 (m), M v i/a : L 2 {v) — > L 2 (m) are isometries. So the boundedness of the 
operator mI /2 TMI /2 in L 2 (m) is equivalent to the boundedness of the operator 

T, = M v - U ,{M l J 2 TM l J 2 )M u ^ = TM U 

acting from L 2 (n) to L 2 {y). 

If the operator T is an integral operator 

Tf(x)= K{x,y)f(y)dm(y), 
Jr n 
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then the operator T M is the integral operator with the same kernel K, but the integration is 
performed with respect to the measure dfi = udm, 

T »f( x ) = K(x,y)f(y)d^(y). 

The adjoint of the operator : L 2 (fi) — > L 2 (v) is the operator T* : L 2 {y) — > L 2 (fi), 

T* = T*M V . 

Again, in the case of integral operator, we have the following representation of T*. 

T u9(y) = K(x,y)g(x)dv(x), 
which explains the subscript v. 

Remark. The formula T* = T*M V may seem strange, it looks like the formula should be 
M U T*. However, this is a correct formula, and the naturally looking formula M U T* is wrong, 
the main reason being that we are considering operators acting between different spaces. 
Namely, if we represent T M as 

Tfj, = M v - 1/2 [M vl/2 TM ul/2 ]M ul/2 

then in brackets all operators are operators in L 2 , so multiplication operators are self-adjoint 
ones. But outside the brackets, the operators M v -ys and M u i/ 2 are unitary operators, 
M v -i/2 : RanM„i/ 2 C L 2 — > L 2 (v) and M u i/ 2 : L 2 (fi) — > RanM u i/ 2 C L 2 . So their adjoint 
are their inverses, and so 

(T M )* = M u ^ /2 [M ul/2 T*M vl/2 }My 2 = T*M V =: T* v . 

Let us now show that the operator T M is indeed a well localized operator in the sense of 
Definition [2H 

Consider a decomposition of the operator T with respect to the non-weighted Haar basis 
in L 2 (m), 

T= Yl T R,Q> T R,Q : H Q Hr ' 
R,Qev 

here recall that Hq denotes the space of all non-weighted Haar functions on the cube Q 
(which is a subspace of L 2 (m) of dimension 2^ — 1). If we chose some orthonormal bases 
{/i<2fc}fc=i and {h R .} 2 _[ 1 in Hq and Hr respectively, then we can represent Tr ; q as 

2^-1 

T R,Q = S ( Th QV h Rj)(' ' h Q,k) h R,f 
j,k=l 



Since the rank one operator (• f)h R . is an integral operator with kernel /i Q k (y)h R ,(x), 
we can conclude that T R q can be represented as an integral operator with bounded compactly 
supported kernel. So, the operators T R q are well defined on L\ oc functions. 
Assume for a moment that u G L 2 oc . Then ux n £ L 2 , so 



Q',R' 
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where the series converges in L 2 . Consider a cube R, £(R) < £(Q). If we also assume that 
v G L 2 oc , then (Tux Q , h R )v = (TuxQ,vh u R ) is well defined and 

(2.1) {Tvx Q ,h" R ) v = £ (T R , iQ ,X Q u,h R )„ 

Q',R'ev 

Note, that T R , q,Xq u only if Q' fl Q ^ 0, so in the above sum we need to consider 

only such Q' . Since a weighted Haar function h R is orthogonal in l?[y) to constants, 
(T R/ Qf x Q u,h u ) v = for R C R> . Also, trivially {T RIQ ,x Q u,h v ) v = if # n R ^ 0. 
And finally, the band structure of T means that T R , , = if d tree (R',Q') > r. So in the 
above sum we can consider only R', Q' satisfying Q' fl Q ^ 0, R' C R and d tTee (R, Q') < r. 

Since R' C R and < £(Q), the assumption dtree(-R', Q') < r implies that £{Q') < 
2 r £{Q). This together with QnQ' ^ implies that Q' C Q (r) . Indeed, QnQ' ^ means 
that either Q' C Q (and so C Q^), or Q C Q', and in the latter case the estimate on the 
size of Q' implies Q' C Qv\ 

Note that if R <£ QW then R £ QW (because R' C R). Therefore, d tree {R', Q') > r + 2, 
because we need at least one step to go above Q^ r ' and then at least r + 1 steps to go to 
cubes of size £{R). Therefore, (T^" 1 ^), h v R ) v = if R £ QM. 

A similar reasoning works for the case £{R) < 2~ r £(Q). Namely, the inclusion R' C R 
and the inequality d tvee (R, Q') < r imply that £{Q') < £(Q), so it follows from Q fl Q' ^ 
that Q' C Q. If R <f. Q then R Q and d tree (i? / , Q') > r + 2 (we need at least one step 
to go from Q' above Q, and then at least r + 1 steps to go to the cubes of size 2~ r £(R)). 
Therefore, (T^Xq), h R ) v = if £{R) < 2~ r £{Q) and R<£Q. 

So, the operator T M is lower triangularly localized, and the same reasoning can be applied 
to the adjoint operator T*. So we have shown, that under the assumption u, v G L 2 oc the 
operator T M obtained from the band operator T is well localized. □ 

To treat the general case let us note that for Q fl R = the above sum (12 .ip has only 
finitely many terms. Each operator T Rl is an integral operator with bounded compactly 

supported kernel, so (Tx n u, h v ) v is well defined for u, v G L\ . 

Moreover, if we take increasing sequences of weights u n ,Vk G L 2 oc1 u n f u, Vk f v, and 
define dvk = Vkdm, then 

lim \im{T Xo u n ,h^) Vk = (T XQ u,h R ) u = (M vl/2 TM ul/2 x Q u 1/2 ,h R v^ 2 ), 



+oo k 



where the function M v \/2TM u \/2XqU 1 ^ 2 is interpreted exactly as in the Remark 11.31 

2.3. Sawyer type results for well localized operators. The following two theorems can 
be also considered the main result of the paper. 

Theorem 2.2. Let T = T M be a well localized operator acting (formally) from L 2 (n) to 
L 2 [y). Then is a bounded operator from L 2 (fi) to L 2 (v) if and only if T and its formal 
adjoint T* are uniformly bounded on characteristic functions of cubes, i. e. iff 

\Kx Q \? L ^<c\\x Q f L ^= c ' ,{Q) ' VQeP ' 
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Moreover, the norm of T can be estimated by a constant depending only on the dimension 
N, the above constants C and r from the definition of well localized operator. 



Theorem 11.21 is an immediate corollary of this theorem. 

The assumptions that T M and T* are uniformly bounded on the characteristic functions 
of cubes can be relaxed a little: one does not have to integrate T^Xq over the whole space. 
Namely, Theorem 12.21 can be restated as follows 



Theorem 2.3. Let T = 7] t be a well localized operator acting (formally) from L 2 {^i) to 
L 2 {y). Then T M is a bounded operator from L 2 ([i) to L 2 (u) if and only if the following two 
conditions hold: 

(1) {T^x Q ,X R )u < C^Qf^^R) 1 ' 2 for all cubes Q, R of comparable size, 2~ r £(Q) < 

£(R) < 2 r £(Q); here r is the number from the definition of well localized operator. 

(2) For all cubes Q 



T »X C 



dv < Cfi(Q), 



v A.( 



dv < Cv{Q). 



Moreover, the norm of T can be estimated by a constant depending only on the dimension 
N , the above constants C and r from the definition of well localized operator. 



Theorem 11.51 is an immediate corollary of the above Theorem 12.31 

Note that in the condition (2) of the theorem one can replace T^Xq by its orthogonal (in 
L 2 (v)) projection onto the subspace of functions F with zero average over Q, fgfdv = 0, and 

similarly for T*Xn- The condition (1) implies that (T^XqiXq)v < C(n(Q)) l l 2 (v(Q)) l l 2 , 
i.e. that the projection onto the orthogonal complement of this subspace is bounded. 

While in our main example the measures // and v are absolutely continuous, and the 
operator T came from an operator in the non- weighted L 2 , in Theorems 12.21 and 12.31 the 
measures are arbitrary regular Borel measures and is an arbitrary operator L 2 (n) — > 
L 2 (v). We only need to know its bilinear form (T^Xq, X r )u to apply the theorems. 

Remark 2.4. We do not have to assume that the operator T is bounded in the non-weighted 
L 2 to be able to apply Theorem 12.21 and 12.31 to get the weighted norm inequalities for T. 
We only need to be able to define {T^Xq, X r )v m Section I2T21 above we had shown how one 
can treat this expression if the operator T is bounded in the non- weighted L 2 . The same 
reasoning will work if T is bounded in any non- weighted LP . 

In general situation (for example, if we have unbounded Haar multipliers) some care is 
needed to interpret i (T 11 Xq-,X r )v, but after that one can apply Theorem 12.21 and 12.31 



3. Proof of Theorems 12.21 and 12.31 

3.1. Martingale difference decomposition. Denote by = E% the averaging operator 
in L 2 (fi) over dyadic cubes of size (length of the side) 2 fe , namely E£f(x) = /i(Q)" 1 f Q fdfi, 

where Q is a dyadic cube of size 2 k containing x (for the sake of definiteness, we consider 
cubes of the form xq + [a, b) N ). If Q is a cube of size 2 k , we denote by f the restriction 

of Elf to Q: EM := (/i(Q)" 1 L fd^) X 



X Q E kf- 
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Let A fc = A£ := El_ x - E%. Again for a dyadic cube Q of size 2 fc , denote by A£/ 
the restriction of A^f to Q. Clearly, for any / G L 2 (fi), the functions Aq/, Q E V, are 
orthogonal to each other, and that for any fixed n we have the orthogonal decomposition 

(3-1) /= E A Qf+ E E & 

Qev,e(Q)<2™ Qev,e(Q)=2" 

\\f\\W)= E ii a q/ii 2 + E \\ E Qf\\ 2 - 

Qev,e(Q)<2 n Qev,e(Q)=2 n 

3.2. Paraproducts. Define the paraproduct IP = ITy, acting (formally) from L 2 (fi) to 
L»by 

nV:=E^/ E A ^ Q - 

Qe£> i?ex>, -RcQ, 
e(R)=2- r e(Q) 

The paraproduct IF = 11^* is defined similarly, 



IT/ E A n T ^Q- 



Qev Rev, RcQ, 
e(R)=2- r e(Q) 

Remark 3.1. Note, that it follows from the definition of well localized operator that if R C Q, 
and £{R) < 2- r £{Q), then for any Q' D Q 

A^ Xq; =A^ Xq . 

In other words, in the definition of II we can always replace Xq by Xq> where Q' is a bigger 
cube. 

That essentially means that formally we can write Tl instead of Tx Q , so the definition is 
more in line with the standard definition of a paraproduct. 

The following lemma describes the matrix of II with respect to the weighted Haar systems 
in L 2 (fi) and L 2 [y). 

Lemma 3.2. Let Q, R be dyadic cubes. Then for the paraproduct U — IP defined above 

(1) If£(R) > 2- r £(Q) then (n"/i£, h R ) v = for all weighted Haar functions h% and h u R . 

(2) If R(jLQ, then (IP/ig, h R ) u = for all weighted Haar functions hg and h R . 

(3) If £{R) < 2~ r £{Q), then for all weighted Haar functions hg and h R 

(Wh^h R ) v = {Th^h R ) v] 
in particular, if R(jL Q, then both sides of the equality are 0. 
Proof. Let us use Q' and R! for the summation indices in the paraproduct, i.e. let us write 

n%-EW E a r> t »x q , 

Q'ev R'ev, R'cQ', 

e(R')=2- r e(Q') 

Since h R is orthogonal to ranges of all projections A RI except A^ we can write 
(3.2) (I%,h&„ = ({E^ Q )A R T X h R ) v = a(T X h R ) v 
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where Q' is the grandparent of R of order r (i.e. the cube Q', Q' D R and such that 
£(Q') = 2 r £(R)) and a is the value of Eg/kg on Q', Eg/kg = clXq,- 

It is easy to see that EqMq ^ (equivalently a ^ 0) only if Q' ^ Q. Therefore, see (13.21) . 
(Il/ig, h R ) u 7^) only if Q' ^ Q and statements 1 and 2 of the lemma follow immediately. 

Indeed, if £(R) > 2- r £(Q) and i{Q') = 2 r £(R), the inclusion Q' C Q [ s impossible, so 
(il/ig, h R ) u = 0, and the statement (1) is proved. 

If R <£. Q, then the inclusion Q' C Q implies that £ Q'. But £(R) = 2- r £(Q'), so by the 
definition of a well localized operator (Hhg, h R ) u = and the statement (2) is proved. 

Let us prove statement 3. Let £{R) < 2~ r £(Q). If R (£_ Q then by the statement 2 of the 
lemma (Il/ig, h v ^) v = 0, and (T^hQ,h u R ) u = by the definition of well localized operators 
("children" Qk of Q are cubes of size at least 2 r £(R), and it follows from the definition of 
well localized operator that (T^XQ k , h^) u = 0). So, we only need to consider the case R C Q. 

Let Qi be the "child" of Q containing R (i.e. R C Q x C Q, £{Q X ) = £(Q)/2), and let a 
be the value of fig on Q\. Then, since for all other children of Q the definition of well 
localized operator implies {Tx n , h^) u = 0, we can conclude that 

( Th Q, h R)u = a(Tx Qi ,h u R ) u 

On the other hand we have shown before, see (13.21) that 

(W Q ,h R ) u = ((E^ Q )A R Tx Q ,,h R ) u 

where R C Q' g Q, £{Q') = 2 r £{R). Therefore Q' C Q x and so E^,hg = ax Ql - We also 
know, see Remark 13.11 that because Q' C Q\ we have equality A R Tx Q , = A r TXq 1 ■ Thus 
we can continue: 

(n/iQ, h v R ) v = a(A R Tx Q , ,h R ) u = a(A R Tx Qi ,h R ) u = a(Tx Qi , h R ) v . 

Therefore (II/iq, h R ) u = {Thg, h R ) u , and the lemma is proved. □ 

It follows from Lemma 13.21 and the definition of a well localized operator, that in the 
weighted Haar bases the matrix of the difference T^— 11^ — (LT^*)* has finitely many diagonals, 
which are bounded by the assumption (1) of Theorem 12.31 Therefore, the main part in the 
proof Theorem 12.31 (and thus Theorem 12.21) is to prove that the paraproducts are bounded. 
Of course, one also need to take care of the terms like E^ f in the decomposition (13. II) . but 

this is ratther simple, we present the details in Section I3~^E1 below 

3.3. Boundedness of the paraproduct. We will need the following well known theorem. 
Let f R := J R f d\i be the average of the function / with respect to the measure /i. 

Theorem 3.3 (Dyadic Carleson Embedding Theorem). // the numbers > 0, Q G T> 
satisfy the following Carleson measure condition 

(3.3) ^2a Q <fi(R), 

QCR 

then for any f G L 2 (fi) 

E a Hi4i 2 <cimiW) 

Rev 

where C is an absolute constant. 
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This theorem is very well known, cf [Dur70| . For an alternative prove see also |NTV] . 
[NTJ , |NTV3j , where it was proved with the constant C = 4 using Bellman function method. 
It was also proved in [NTV3j that the constant C = 4 is optimal. We should mention, 
that in |NTj . [NTV3] this theorem was proved for R 1 , but the same proof works for general 
martingale setup. A proof for R 2 was presented in |NTV] . and the same proof works for R . 



Let us now show that the paraproduct IT = IT^ is bounded. Ranges of the projections 8 y R 
are mutually orthogonal, so to prove the boundedness of the paraproduct IT^ it is sufficient 
to show that the numbers 



E w^xjhiu) < 



R£T>,RcQ 
£(R)=2- r £(Q) 



satisfy the Carleson Measure Condition (I3.3P from Theorem 13.31 Let us prove this. 
Consider a cube Q. We want to show that 

E E W^X Q \\h {v) <CKQ). 
QcQ R^v,rcq 

i{R)=2- r i{Q) 

By Remark [3.11 we can replace x n by X~> so the desired estimates becomes 

E \\^R T vXq\\l 2 (u) < E \\^R T vXq\\l 2 (v) < Wx^T^X-Wh^y 
ReV,RcQ_ RcQ 

e(R)<2- r e(Q) 
By assumption (2) of Theorem 12.31 

\\x^x~\\h {u) := J \T,X-\ 3 dv < Cfx(Q) 

and so the sequence a , Q eV satisfies the condition (13. 3p □ 



3.4. Why T is bounded. Let / G L 2 (/i), g £ L 2 (u). 
We want to estimate 

\(Tnf,g) v \ < C\\f\\ L 2 {fl) \\g\\ L 2 {u) . 

It is sufficient to prove the estimate on a dense set of compactly supported functions. Each 
compact subset of M> N is contained in at most 2 N cubes of the same size as the size of this 
compact subset, so let Qk, k = 1,2, . . . ,2 N be cubes of size 2 d containing supports of / and 
9- 

Let us decompose / and g using the martingale difference decomposition, 
f = H E QJ+ E A £/' 9 = T,Eq>9+ E A Q9- 

k=l Q:£(Q)<2 d k=l Q:l{Q)<2 d 

Note, that Agf and A Q g are /i- and z/-Haar functions on the cube Q. 
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By Lemma 13.21 



+ E E (r M A5/,A^>„ 

Q:f(Q)<2 d \R:2- r e(Q)<£{R)<2 r e(Q) 

We know that the paraproducts lT^, and IT^* are bounded, so it remains to estimate the 
last sum. It follows from the assumption (1) of Theorem 12.31 that | (T^Agf, A u R g) v \ < 
C\\A»f\\ L2{lx) || A R g\\ LHu) if 2-*£(Q) < £(R) < 2 r £(Q). 

On the other hand, it follows from the definition of a well localized operator, that given 
a cube Q at most M terms (T^Agf, A u R g) v are non-zero, where M does not depend on the 
choice of Q. Therefore we can split the sum into at most M sums of form (operators with 
"one diagonal") 

J2(T^ Q f,A» R{Q) g) u , 
Q 

which can be estimated by Cauchy-Schwarz inequality. 

It remains to estimate the terms involving Eq. By assumption (1) of Theorem 12.31 
\( T » E Qji E Qj9)\ - C \\x Qk \\LHpL)\\x Q \\tf(v) < CH/IU^Hfl'IU^), and since there are at most 
2 N cubes Qk we can estimate 

El( r ^Q fc /^Q^),|<2 2JV C||/|U 2(M) ||g|U 2M . 

k,3 

Let us now estimate 

E|EWJ' A ^>, 

k R 

For a fixed Qk we have by the assumption of Theorem 12.21 

E (T,E»J,A R g) u \ < l7||/|| l2(m) ||s||l». 

Rl(R)<i(Q k ) 

But we have at most 2 N cubes Qk, so the whole sum is bounded by 2 N C\\ f\\L 2 (u)\\9\\L 2 (v)- 



The sum 



ElE^^^ 



can be estimated similarly. 



k R. 

□ 
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